Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 10908300 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 915.5 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 11 |
|---|
EngineSpeed is highly correlated with EngineAirInletPressure and 1 other fields | High correlation |
Fuel Rate is highly correlated with Engine Load and 2 other fields | High correlation |
Engine Load is highly correlated with Boost Pressure and 2 other fields | High correlation |
Boost Pressure is highly correlated with Engine Load and 2 other fields | High correlation |
EngineAirInletPressure is highly correlated with EngineSpeed and 3 other fields | High correlation |
AcceleratorPedalPos is highly correlated with Engine Load and 2 other fields | High correlation |
VehicleSpeed is highly correlated with EngineSpeed | High correlation |
Fuel Rate is highly skewed (γ1 = 43.53356777) | Skewed |
Timestamp has unique values | Unique |
LongitudAcc has 2127170 (19.5%) zeros | Zeros |
EngineSpeed has 231436 (2.1%) zeros | Zeros |
Fuel Rate has 2554907 (23.4%) zeros | Zeros |
Engine Load has 2567256 (23.5%) zeros | Zeros |
Boost Pressure has 1653306 (15.2%) zeros | Zeros |
AcceleratorPedalPos has 4389455 (40.2%) zeros | Zeros |
VehicleSpeed has 1466420 (13.4%) zeros | Zeros |
BrakePedalPos has 8997303 (82.5%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-22 21:23:49.537100 |
|---|---|
| Analysis finished | 2022-11-22 21:37:20.893759 |
| Duration | 13 minutes and 31.36 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 10908300 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.8106894 × 1010 |
| Minimum | 2.627644776 × 1010 |
|---|---|
| Maximum | 8.299465088 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 2.627644776 × 1010 |
|---|---|
| 5-th percentile | 2.840024754 × 1010 |
| Q1 | 5.040331817 × 1010 |
| median | 6.059781273 × 1010 |
| Q3 | 7.260733137 × 1010 |
| 95-th percentile | 8.109723985 × 1010 |
| Maximum | 8.299465088 × 1010 |
| Range | 5.671820312 × 1010 |
| Interquartile range (IQR) | 2.22040132 × 1010 |
Descriptive statistics
| Standard deviation | 1.728947594 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.2975460355 |
| Kurtosis | -1.027714024 |
| Mean | 5.8106894 × 1010 |
| Median Absolute Deviation (MAD) | 1.184510022 × 1010 |
| Skewness | -0.4739254843 |
| Sum | 6.338474318 × 1017 |
| Variance | 2.989259784 × 1020 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.627644776 × 1010 | 1 | < 0.1% |
| 6.955835094 × 1010 | 1 | < 0.1% |
| 6.955835293 × 1010 | 1 | < 0.1% |
| 6.955835493 × 1010 | 1 | < 0.1% |
| 6.955835596 × 1010 | 1 | < 0.1% |
| 6.955835686 × 1010 | 1 | < 0.1% |
| 6.955835794 × 1010 | 1 | < 0.1% |
| 6.955835986 × 1010 | 1 | < 0.1% |
| 6.955836094 × 1010 | 1 | < 0.1% |
| 6.955836194 × 1010 | 1 | < 0.1% |
| Other values (10908290) | 10908290 |
| Value | Count | Frequency (%) |
| 2.627644776 × 1010 | 1 | |
| 2.627644846 × 1010 | 1 | |
| 2.627644955 × 1010 | 1 | |
| 2.627645072 × 1010 | 1 | |
| 2.627645144 × 1010 | 1 | |
| 2.627645262 × 1010 | 1 | |
| 2.627645376 × 1010 | 1 | |
| 2.627645445 × 1010 | 1 | |
| 2.627645562 × 1010 | 1 | |
| 2.62764568 × 1010 | 1 |
| Value | Count | Frequency (%) |
| 8.299465088 × 1010 | 1 | |
| 8.299464979 × 1010 | 1 | |
| 8.299464867 × 1010 | 1 | |
| 8.299464799 × 1010 | 1 | |
| 8.299464682 × 1010 | 1 | |
| 8.299464567 × 1010 | 1 | |
| 8.299464499 × 1010 | 1 | |
| 8.29946438 × 1010 | 1 | |
| 8.299464266 × 1010 | 1 | |
| 8.299464193 × 1010 | 1 |
WetTankAirPressure
Real number (ℝ≥0)
| Distinct | 181 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.0609771 |
| Minimum | 0 |
|---|---|
| Maximum | 12.411 |
| Zeros | 2243 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10.2046 |
| Q1 | 10.82515 |
| median | 11.1699 |
| Q3 | 11.51465 |
| 95-th percentile | 11.79045 |
| Maximum | 12.411 |
| Range | 12.411 |
| Interquartile range (IQR) | 0.6895 |
Descriptive statistics
| Standard deviation | 0.7514544128 |
|---|---|
| Coefficient of variation (CV) | 0.06793743502 |
| Kurtosis | 42.85687793 |
| Mean | 11.0609771 |
| Median Absolute Deviation (MAD) | 0.34475 |
| Skewness | -5.007766149 |
| Sum | 120656456.5 |
| Variance | 0.5646837345 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11.51465 | 652909 | 6.0% |
| 11.4457 | 651738 | 6.0% |
| 11.5836 | 612920 | 5.6% |
| 11.37675 | 612235 | 5.6% |
| 10.8941 | 587552 | 5.4% |
| 10.82515 | 579800 | 5.3% |
| 10.96305 | 566822 | 5.2% |
| 11.23885 | 556980 | 5.1% |
| 11.1699 | 543814 | 5.0% |
| 11.10095 | 536611 | 4.9% |
| Other values (171) | 5006919 |
| Value | Count | Frequency (%) |
| 0 | 2243 | |
| 0.06895 | 17 | < 0.1% |
| 0.1379 | 16 | < 0.1% |
| 0.20685 | 13 | < 0.1% |
| 0.2758 | 11 | < 0.1% |
| 0.34475 | 17 | < 0.1% |
| 0.4137 | 12 | < 0.1% |
| 0.48265 | 18 | < 0.1% |
| 0.5516 | 46 | < 0.1% |
| 0.62055 | 18 | < 0.1% |
| Value | Count | Frequency (%) |
| 12.411 | 13 | < 0.1% |
| 12.34205 | 38 | < 0.1% |
| 12.2731 | 116 | < 0.1% |
| 12.20415 | 223 | < 0.1% |
| 12.1352 | 1407 | < 0.1% |
| 12.06625 | 9893 | 0.1% |
| 11.9973 | 44284 | 0.4% |
| 11.92835 | 93494 | 0.9% |
| 11.8594 | 215173 | |
| 11.79045 | 325162 |
| Distinct | 120 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.03205364722 |
| Minimum | -10.1 |
|---|---|
| Maximum | 13 |
| Zeros | 2127170 |
| Zeros (%) | 19.5% |
| Negative | 4605319 |
| Negative (%) | 42.2% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | -10.1 |
|---|---|
| 5-th percentile | -1.1 |
| Q1 | -0.3 |
| median | 0 |
| Q3 | 0.3 |
| 95-th percentile | 0.9 |
| Maximum | 13 |
| Range | 23.1 |
| Interquartile range (IQR) | 0.6 |
Descriptive statistics
| Standard deviation | 0.6739510985 |
|---|---|
| Coefficient of variation (CV) | -21.02572272 |
| Kurtosis | 90.5194209 |
| Mean | -0.03205364722 |
| Median Absolute Deviation (MAD) | 0.3 |
| Skewness | 4.412395045 |
| Sum | -349650.8 |
| Variance | 0.4542100832 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2127170 | |
| -0.1 | 735450 | 6.7% |
| -0.2 | 691634 | 6.3% |
| 0.1 | 684007 | 6.3% |
| 0.2 | 610760 | 5.6% |
| -0.3 | 591412 | 5.4% |
| 0.3 | 537225 | 4.9% |
| -0.4 | 504214 | 4.6% |
| 0.4 | 461032 | 4.2% |
| 0.5 | 396778 | 3.6% |
| Other values (110) | 3568618 |
| Value | Count | Frequency (%) |
| -10.1 | 1 | |
| -9.7 | 1 | |
| -8.7 | 1 | |
| -7.6 | 2 | |
| -7.1 | 2 | |
| -6.8 | 1 | |
| -6.7 | 1 | |
| -6.5 | 1 | |
| -6.4 | 2 | |
| -6.3 | 1 |
| Value | Count | Frequency (%) |
| 13 | 3402 | |
| 12.9 | 3801 | |
| 5.4 | 1 | < 0.1% |
| 5.1 | 1 | < 0.1% |
| 4.8 | 1 | < 0.1% |
| 4.7 | 2 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 4.4 | 4 | < 0.1% |
| 4.3 | 2 | < 0.1% |
| 4.2 | 5 | < 0.1% |
| Distinct | 14429 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1073.753356 |
| Minimum | 0 |
|---|---|
| Maximum | 8191.875 |
| Zeros | 231436 |
| Zeros (%) | 2.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 549.5 |
| Q1 | 933.625 |
| median | 1163.125 |
| Q3 | 1286.625 |
| 95-th percentile | 1447.875 |
| Maximum | 8191.875 |
| Range | 8191.875 |
| Interquartile range (IQR) | 353 |
Descriptive statistics
| Standard deviation | 326.1297299 |
|---|---|
| Coefficient of variation (CV) | 0.3037287177 |
| Kurtosis | 9.127563814 |
| Mean | 1073.753356 |
| Median Absolute Deviation (MAD) | 149 |
| Skewness | -0.640048706 |
| Sum | 1.171282373 × 1010 |
| Variance | 106360.6008 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 231436 | 2.1% |
| 599.875 | 10276 | 0.1% |
| 600.5 | 10273 | 0.1% |
| 600.375 | 10199 | 0.1% |
| 600 | 10182 | 0.1% |
| 600.125 | 10172 | 0.1% |
| 600.75 | 10161 | 0.1% |
| 600.25 | 10139 | 0.1% |
| 599.75 | 10100 | 0.1% |
| 600.875 | 10094 | 0.1% |
| Other values (14419) | 10585268 |
| Value | Count | Frequency (%) |
| 0 | 231436 | |
| 15.625 | 1 | < 0.1% |
| 16.5 | 1 | < 0.1% |
| 17.25 | 1 | < 0.1% |
| 17.375 | 1 | < 0.1% |
| 19.125 | 1 | < 0.1% |
| 20.25 | 2 | < 0.1% |
| 20.875 | 1 | < 0.1% |
| 21.75 | 1 | < 0.1% |
| 21.875 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8191.875 | 395 | |
| 2151.5 | 1 | < 0.1% |
| 2138.625 | 1 | < 0.1% |
| 2133.375 | 1 | < 0.1% |
| 2132.625 | 1 | < 0.1% |
| 2126.125 | 2 | < 0.1% |
| 2125.625 | 2 | < 0.1% |
| 2124.75 | 1 | < 0.1% |
| 2124 | 1 | < 0.1% |
| 2123.75 | 1 | < 0.1% |
| Distinct | 1108 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.72080786 |
| Minimum | 0 |
|---|---|
| Maximum | 3876.198645 |
| Zeros | 2554907 |
| Zeros (%) | 23.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.064646 |
| median | 8.576315 |
| Q3 | 21.766096 |
| 95-th percentile | 46.312101 |
| Maximum | 3876.198645 |
| Range | 3876.198645 |
| Interquartile range (IQR) | 20.70145 |
Descriptive statistics
| Standard deviation | 85.92078612 |
|---|---|
| Coefficient of variation (CV) | 5.465417993 |
| Kurtosis | 1952.6881 |
| Mean | 15.72080786 |
| Median Absolute Deviation (MAD) | 8.576315 |
| Skewness | 43.53356777 |
| Sum | 171487288.4 |
| Variance | 7382.381488 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2554907 | 23.4% |
| 3.312232 | 65396 | 0.6% |
| 3.371379 | 63773 | 0.6% |
| 3.253085 | 61747 | 0.6% |
| 3.430526 | 60585 | 0.6% |
| 3.489673 | 56004 | 0.5% |
| 3.193938 | 55414 | 0.5% |
| 3.54882 | 52889 | 0.5% |
| 3.962849 | 52520 | 0.5% |
| 4.021996 | 52294 | 0.5% |
| Other values (1098) | 7832771 |
| Value | Count | Frequency (%) |
| 0 | 2554907 | |
| 0.059147 | 10599 | 0.1% |
| 0.118294 | 10312 | 0.1% |
| 0.177441 | 11789 | 0.1% |
| 0.236588 | 14361 | 0.1% |
| 0.295735 | 13892 | 0.1% |
| 0.354882 | 12336 | 0.1% |
| 0.414029 | 11726 | 0.1% |
| 0.473176 | 10366 | 0.1% |
| 0.532323 | 8726 | 0.1% |
| Value | Count | Frequency (%) |
| 3876.198645 | 5231 | |
| 3870.224798 | 1 | < 0.1% |
| 3860.879572 | 1 | < 0.1% |
| 3783.278708 | 1 | < 0.1% |
| 3025.132462 | 1 | < 0.1% |
| 2840.179793 | 1 | < 0.1% |
| 2004.787565 | 1 | < 0.1% |
| 65.0617 | 24 | < 0.1% |
| 65.002553 | 22 | < 0.1% |
| 64.943406 | 38 | < 0.1% |
| Distinct | 201 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.36381581 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 2567256 |
| Zeros (%) | 23.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 24.5 |
| Q3 | 45.5 |
| 95-th percentile | 89.5 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 42.5 |
Descriptive statistics
| Standard deviation | 27.65931726 |
|---|---|
| Coefficient of variation (CV) | 0.9109302148 |
| Kurtosis | -0.0496729248 |
| Mean | 30.36381581 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 0.8495695205 |
| Sum | 331217612 |
| Variance | 765.0378312 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 2567256 | 23.5% |
| 100 | 287812 | 2.6% |
| 21 | 125412 | 1.1% |
| 20.5 | 125030 | 1.1% |
| 21.5 | 124726 | 1.1% |
| 22.5 | 124092 | 1.1% |
| 23 | 122595 | 1.1% |
| 22 | 122413 | 1.1% |
| 23.5 | 121785 | 1.1% |
| 20 | 119480 | 1.1% |
| Other values (191) | 7067699 |
| Value | Count | Frequency (%) |
| 0 | 2567256 | |
| 0.5 | 42708 | 0.4% |
| 1 | 33287 | 0.3% |
| 1.5 | 26477 | 0.2% |
| 2 | 24737 | 0.2% |
| 2.5 | 21951 | 0.2% |
| 3 | 23528 | 0.2% |
| 3.5 | 22103 | 0.2% |
| 4 | 24212 | 0.2% |
| 4.5 | 22611 | 0.2% |
| Value | Count | Frequency (%) |
| 100 | 287812 | |
| 99.5 | 9808 | 0.1% |
| 99 | 11572 | 0.1% |
| 98.5 | 14015 | 0.1% |
| 98 | 12458 | 0.1% |
| 97.5 | 12369 | 0.1% |
| 97 | 11842 | 0.1% |
| 96.5 | 11591 | 0.1% |
| 96 | 12062 | 0.1% |
| 95.5 | 11411 | 0.1% |
| Distinct | 188 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2223742027 |
| Minimum | 0 |
|---|---|
| Maximum | 1.611566 |
| Zeros | 1653306 |
| Zeros (%) | 15.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.051708 |
| median | 0.120652 |
| Q3 | 0.30163 |
| 95-th percentile | 0.784238 |
| Maximum | 1.611566 |
| Range | 1.611566 |
| Interquartile range (IQR) | 0.249922 |
Descriptive statistics
| Standard deviation | 0.2616009084 |
|---|---|
| Coefficient of variation (CV) | 1.176399534 |
| Kurtosis | 4.157642348 |
| Mean | 0.2223742027 |
| Median Absolute Deviation (MAD) | 0.112034 |
| Skewness | 1.962960112 |
| Sum | 2425724.515 |
| Variance | 0.06843503528 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1653306 | 15.2% |
| 0.08618 | 436050 | 4.0% |
| 0.077562 | 428160 | 3.9% |
| 0.094798 | 396454 | 3.6% |
| 0.068944 | 370214 | 3.4% |
| 0.103416 | 334827 | 3.1% |
| 0.060326 | 288077 | 2.6% |
| 0.112034 | 270791 | 2.5% |
| 0.051708 | 228753 | 2.1% |
| 0.120652 | 223067 | 2.0% |
| Other values (178) | 6278601 |
| Value | Count | Frequency (%) |
| 0 | 1653306 | |
| 0.008618 | 212132 | 1.9% |
| 0.017236 | 158043 | 1.4% |
| 0.025854 | 160661 | 1.5% |
| 0.034472 | 181503 | 1.7% |
| 0.04309 | 199324 | 1.8% |
| 0.051708 | 228753 | 2.1% |
| 0.060326 | 288077 | 2.6% |
| 0.068944 | 370214 | 3.4% |
| 0.077562 | 428160 | 3.9% |
| Value | Count | Frequency (%) |
| 1.611566 | 3 | < 0.1% |
| 1.602948 | 17 | < 0.1% |
| 1.59433 | 15 | < 0.1% |
| 1.585712 | 12 | < 0.1% |
| 1.577094 | 15 | < 0.1% |
| 1.568476 | 38 | |
| 1.559858 | 25 | < 0.1% |
| 1.55124 | 29 | < 0.1% |
| 1.542622 | 68 | |
| 1.534004 | 76 |
| Distinct | 98 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 123.7234423 |
| Minimum | 32 |
|---|---|
| Maximum | 510 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 32 |
|---|---|
| 5-th percentile | 100 |
| Q1 | 106 |
| median | 114 |
| Q3 | 132 |
| 95-th percentile | 180 |
| Maximum | 510 |
| Range | 478 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 26.37856635 |
|---|---|
| Coefficient of variation (CV) | 0.2132058877 |
| Kurtosis | 5.683933293 |
| Mean | 123.7234423 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 2.029374556 |
| Sum | 1349612426 |
| Variance | 695.8287627 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 110 | 958940 | 8.8% |
| 102 | 920858 | 8.4% |
| 100 | 766118 | 7.0% |
| 112 | 754953 | 6.9% |
| 108 | 745740 | 6.8% |
| 114 | 509557 | 4.7% |
| 106 | 487704 | 4.5% |
| 104 | 434920 | 4.0% |
| 116 | 408424 | 3.7% |
| 118 | 349066 | 3.2% |
| Other values (88) | 4572020 |
| Value | Count | Frequency (%) |
| 32 | 3 | < 0.1% |
| 34 | 33 | |
| 50 | 15 | |
| 52 | 15 | |
| 64 | 1 | < 0.1% |
| 66 | 18 | |
| 68 | 32 | |
| 70 | 4 | < 0.1% |
| 86 | 4 | < 0.1% |
| 92 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 510 | 384 | < 0.1% |
| 508 | 25 | < 0.1% |
| 264 | 1 | < 0.1% |
| 262 | 30 | < 0.1% |
| 260 | 39 | < 0.1% |
| 258 | 79 | < 0.1% |
| 256 | 106 | < 0.1% |
| 254 | 261 | < 0.1% |
| 252 | 563 | |
| 250 | 1074 |
| Distinct | 251 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.03600088 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 4389455 |
| Zeros (%) | 40.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 40.4 |
| Q3 | 66.8 |
| 95-th percentile | 97.6 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 66.8 |
Descriptive statistics
| Standard deviation | 35.03854349 |
|---|---|
| Coefficient of variation (CV) | 0.9460671417 |
| Kurtosis | -1.427659708 |
| Mean | 37.03600088 |
| Median Absolute Deviation (MAD) | 40.4 |
| Skewness | 0.2296627301 |
| Sum | 403999808.4 |
| Variance | 1227.69953 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 4389455 | |
| 100 | 476280 | 4.4% |
| 62 | 55335 | 0.5% |
| 60.8 | 53298 | 0.5% |
| 59.6 | 53060 | 0.5% |
| 59.2 | 52990 | 0.5% |
| 62.8 | 51960 | 0.5% |
| 64.4 | 51632 | 0.5% |
| 57.6 | 51280 | 0.5% |
| 61.2 | 51143 | 0.5% |
| Other values (241) | 5621867 |
| Value | Count | Frequency (%) |
| 0 | 4389455 | |
| 0.4 | 4766 | < 0.1% |
| 0.8 | 4753 | < 0.1% |
| 1.2 | 4801 | < 0.1% |
| 1.6 | 4513 | < 0.1% |
| 2 | 4587 | < 0.1% |
| 2.4 | 4475 | < 0.1% |
| 2.8 | 4715 | < 0.1% |
| 3.2 | 4972 | < 0.1% |
| 3.6 | 4476 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 476280 | |
| 99.6 | 12331 | 0.1% |
| 99.2 | 12148 | 0.1% |
| 98.8 | 12924 | 0.1% |
| 98.4 | 12157 | 0.1% |
| 98 | 12995 | 0.1% |
| 97.6 | 12972 | 0.1% |
| 97.2 | 13883 | 0.1% |
| 96.8 | 13390 | 0.1% |
| 96.4 | 14260 | 0.1% |
| Distinct | 24448 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.4453664 |
| Minimum | 0 |
|---|---|
| Maximum | 255.97971 |
| Zeros | 1466420 |
| Zeros (%) | 13.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 18.08478 |
| median | 42.020748 |
| Q3 | 60.355512 |
| 95-th percentile | 77.245056 |
| Maximum | 255.97971 |
| Range | 255.97971 |
| Interquartile range (IQR) | 42.270732 |
Descriptive statistics
| Standard deviation | 25.3683214 |
|---|---|
| Coefficient of variation (CV) | 0.6431255103 |
| Kurtosis | -0.6116311278 |
| Mean | 39.4453664 |
| Median Absolute Deviation (MAD) | 20.91663 |
| Skewness | -0.04945009513 |
| Sum | 430281890.3 |
| Variance | 643.5517305 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1466420 | 13.4% |
| 69.042456 | 1104 | < 0.1% |
| 69.003396 | 1075 | < 0.1% |
| 69.011208 | 1051 | < 0.1% |
| 68.983866 | 1037 | < 0.1% |
| 69.112764 | 1034 | < 0.1% |
| 70.022862 | 1034 | < 0.1% |
| 68.964336 | 1030 | < 0.1% |
| 69.124482 | 1029 | < 0.1% |
| 68.944806 | 1028 | < 0.1% |
| Other values (24438) | 9432458 |
| Value | Count | Frequency (%) |
| 0 | 1466420 | |
| 0.999936 | 111 | < 0.1% |
| 1.003842 | 114 | < 0.1% |
| 1.007748 | 97 | < 0.1% |
| 1.011654 | 122 | < 0.1% |
| 1.01556 | 112 | < 0.1% |
| 1.019466 | 122 | < 0.1% |
| 1.023372 | 122 | < 0.1% |
| 1.027278 | 106 | < 0.1% |
| 1.031184 | 104 | < 0.1% |
| Value | Count | Frequency (%) |
| 255.97971 | 392 | |
| 255.975804 | 701 | |
| 106.872066 | 1 | < 0.1% |
| 106.71192 | 1 | < 0.1% |
| 106.532244 | 1 | < 0.1% |
| 106.372098 | 1 | < 0.1% |
| 106.34085 | 1 | < 0.1% |
| 106.290072 | 1 | < 0.1% |
| 106.28226 | 1 | < 0.1% |
| 105.610428 | 1 | < 0.1% |
| Distinct | 239 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.195319747 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 8997303 |
| Zeros (%) | 82.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 83.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 21.6 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.538444664 |
|---|---|
| Coefficient of variation (CV) | 2.359214495 |
| Kurtosis | 4.886594089 |
| Mean | 3.195319747 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.31437582 |
| Sum | 34855506.4 |
| Variance | 56.82814796 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 8997303 | |
| 15.6 | 100349 | 0.9% |
| 17.2 | 97817 | 0.9% |
| 16 | 83924 | 0.8% |
| 16.4 | 81978 | 0.8% |
| 16.8 | 74605 | 0.7% |
| 17.6 | 73225 | 0.7% |
| 15.2 | 72021 | 0.7% |
| 19.2 | 42756 | 0.4% |
| 18 | 40508 | 0.4% |
| Other values (229) | 1243814 | 11.4% |
| Value | Count | Frequency (%) |
| 0 | 8997303 | |
| 0.4 | 12071 | 0.1% |
| 0.8 | 8297 | 0.1% |
| 1.2 | 7850 | 0.1% |
| 1.6 | 7856 | 0.1% |
| 2 | 7415 | 0.1% |
| 2.4 | 7218 | 0.1% |
| 2.8 | 6913 | 0.1% |
| 3.2 | 6504 | 0.1% |
| 3.6 | 6708 | 0.1% |
| Value | Count | Frequency (%) |
| 98 | 14 | |
| 97.6 | 31 | |
| 97.2 | 2 | < 0.1% |
| 96.4 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 95.6 | 1 | < 0.1% |
| 95.2 | 1 | < 0.1% |
| 94.8 | 4 | < 0.1% |
| 93.6 | 2 | < 0.1% |
| 93.2 | 2 | < 0.1% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 1 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 2 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 3 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 4 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 5 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 6 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 7 | 2.627645e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 8 | 2.627646e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
| 9 | 2.627646e+10 | 4.2749 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 | 0.0 | 0.0 |
Last rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 10908290 | 8.299464e+10 | 11.79045 | -0.3 | 1126.125 | 3.548820 | 8.5 | 0.068944 | 108.0 | 34.4 | 12.983544 | 0.0 |
| 10908291 | 8.299464e+10 | 11.79045 | -1.1 | 1079.625 | 3.489673 | 8.0 | 0.060326 | 108.0 | 16.8 | 12.401550 | 0.0 |
| 10908292 | 8.299464e+10 | 11.79045 | -0.7 | 952.750 | 0.000000 | 0.0 | 0.060326 | 108.0 | 0.0 | 11.139912 | 17.6 |
| 10908293 | 8.299464e+10 | 11.72150 | -0.8 | 684.250 | 1.892704 | 8.0 | 0.051708 | 106.0 | 0.0 | 8.327592 | 16.4 |
| 10908294 | 8.299465e+10 | 11.72150 | -1.8 | 566.750 | 3.548820 | 22.0 | 0.034472 | 104.0 | 0.0 | 5.417622 | 13.6 |
| 10908295 | 8.299465e+10 | 11.65255 | -0.2 | 596.500 | 4.081143 | 24.0 | 0.008618 | 102.0 | 0.0 | 2.331882 | 0.0 |
| 10908296 | 8.299465e+10 | 11.58360 | -0.4 | 616.750 | 3.430526 | 19.5 | 0.000000 | 102.0 | 0.0 | 1.796760 | 11.2 |
| 10908297 | 8.299465e+10 | 11.51465 | 0.0 | 573.000 | 3.726261 | 20.5 | 0.000000 | 102.0 | 0.0 | 0.000000 | 18.4 |
| 10908298 | 8.299465e+10 | 11.44570 | 0.0 | 617.500 | 3.726261 | 20.0 | 0.000000 | 102.0 | 0.0 | 0.000000 | 0.0 |
| 10908299 | 8.299465e+10 | 11.44570 | 0.0 | 589.625 | 4.199437 | 24.5 | 0.000000 | 102.0 | 0.0 | 0.000000 | 0.0 |